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Abstract 

A simple unified closed form derivation of the non-linearities of the Einstein, Yang-Mills and 
spinlcss (e.g., chiral) meson systems is given. For the first two, the non-linearities are required by 
locality and consistency; in all cases, they are determined by the conserved currents associated 
with the initial (linear) gauge invariance of the first kind. Use of first-order formalism leads 
uniformly to a simple cubic self-interaction. 

Introduction 

The Maxwell and Einstein fields are, respectively, the most and least linear of gauge theories. 
The electrical neutrality of the photon reflects the absence of self-interaction, while at the 
other extreme, the gravitational field equations are an infinite series in the metric due to the 
gravitational 'weight' of gravitons and of their interaction energy. Between these extremes stand 
theories with internal gauge symmetry, typified by the (spin 1) Yang-Mills field and by (spin 0) 
chiral Lagrangians. We wish to give a simple physical derivation of the non-linearity of these 
theories, using a now familiar argument (e.g., [1-6]) leading from the linear massless spin 2 field 
to the full Einstein equations. This argument, which stresses the self-interaction (rather than 
gauge invariance) aspects, proceeds by adjoining to the initially linear theory a source which 
is obtained from the free part itself, a further source due to this one, etc., thus introducing 
new, non-linear terms in the action. The various non-linearities are thereby exhibited as specific 
self-interactions. We shall present a unified derivation, based on use of first-order actions, of the 
non-linearities of the above fields. All of them will emerge precisely as having cubic Lagrangians 
of the same generic form. In particular, the Einstein equations will be derived in one (closed 
form) step, rather than as an infinite series. There, consistency implies universal (including 
self-) coupling, and therefore the equivalence principle. 

Metric Field 

The Einstein equations may be derived non-geometrically [1-6] by noting that the free mass- 
less spin 2 field equations, 

R^M-IKM^^g^M (i) 

whose source is the matter stress-tensor T^ v , must actually be coupled to the total stress-tensor 
including that of the 0-field itself. That is, while the free-field equations (1) are of course quite 
consistent as they stand, this is no longer the case when there is a dynamical system's T^ v 
as a source. For then the left side, which is identically divergenceless, is inconsistent with the 
right, since the coupling implies that T^ v v as computed from the matter equations of motion, 
is no longer conserved. To remedy this, 2 the stress tensor 2 9 tlL/ arising from the quadratic 

2 Consistency and linearity may also be reconciled if locality is abandoned [14]. 
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Lagrangian 2 L responsible for equation (1) is then inserted on the right. But the Lagrangian 3 L 
leading to these modified equations is then cubic, and itself contributes a cubic 3 6 fll ,. This series 
continues indefinitely, and sums (if properly derived!) to the full non-linear Einstein equations, 
G^(rj a f3 + 4>ap) = —kT^, which are an infinite series in the deviation cf)^ of the metric 
from its Minkowskian value r/^. Once the iteration is begun (whether or not a is actually 
present), it must be continued to all orders, since conservation only holds for the full series 
n 9^ u . Thus, the theory is cither left in its (physically irrelevant) free linear form (1), or 
it must be an infinite series. The actual process of inserting the 6^„ of the system at each 
step is the prototype of our method: the 'current' on the right is that generated by the initial 
constant gauge invariance of the theory. In this case, the 0^ are the coefficients of local Lorentz 
transformations, since the invariance is that under rigid Lorentz rotations. This procedure is 
necessary when-and only when-there is also an initial gauge invariance of the second kind (here 
5<pfiu = d^ v + <9„£ M ) which implies identical conservation of the free field part, although this 
invariance is in fact violated by the iteration procedure. The current is defined at each step by 
invariance under constant transformations. 

We now derive the full Einstein equations, on the basis of the same self-coupling requirement, 
but with the advantages that the full theory emerges in closed form with just one added (cubic) 
term, rather than as an infinite series, and that no special 'gauge' such as = need be 
introduced. This is made possible by use of first-order form, in which the metric and affinity 
are a priori independent, and by taking as initial variables the linearizations not of g M!y , but of 
g^ v , the contravariant metric density, 

We begin by recording the full Einstein action in first-order form 

/ = J d 4 xU = J d 4 xg^R^ (2) 

which yields the field equations 

I r\: I 

(3a) 




p = pa _ JL pa 1 pa _i_ pa pP pa pp 1 n /o"U \ 

Here { ™ } is the Christoffel symbol constructed from the metric, and r^„ and g^ v have been 
varied independently Note that the action is just cubic in these basic variables. The free 
massless spin 2 theory (linearized approximation) may be represented by the quadratic action 

i L = J d'xnQ = J d 4 x[h^(T^ ia iv) + »r (i%,r Q r^rg„)] (4) 

with field equations 

21%, - r,%T v rf ^ = h^ a h"^ I (5a) 

I" \j,v,a — 2 ^ f J -- u ~~ 2 ^"iM = ^ (^b) 

where T M = T^ a . It differs from (2) only in the replacement of g^ v by rf v in the cubic term. 
As all indices are moved by r]^, we need only keep track of the symmetry of W and of the 
bottom indices of T^ v . Differentiation of (5a) with respect to a yields the linear equation 

2i#„ = ubT - h^ av - h va ^ - \ Vltu nh aa = (6) 
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which are equivalent to (1), with the relation (p^ = —h^ + ^ n^haa. We now demand that 
equation (6) be augmented by the source: 3 t m „ = T^ v — \ rj^ u T aa , where T^ v is the stress-tensor 
of the linear action of equation (4). It is very simply computed in the usual (Rosenfeld) way 
as the variational derivative of I L with respect to an auxiliary contravariant metric density 
tp^ v , upon writing I L in 'generally covariant form', l L (r\ — ► ip), with respect to this metric. 
Note that this does not presuppose any geometrical notions, being merely a mathematical 
shortcut in finding the symmetric stress-tensor of I L . We could also obtain it by the (equivalent) 
(Belinfante) prescription of introducing local Lorentz transformations. The covariant action is 
simply (4) with rf LV — > ip^ v and ^-covariant derivatives in the hdT term: 

si L W = J d 4 x5r v [(h af3 r x - 2hpf>r a pX + brv%. 6 p x )(sc^/sri 

+ (r^-r^itj] (?) 

where C^p is the Christoffel symbol of the tp. We have chosen to let h^ v transform as a 
contravariant tensor density and as a tensor in this auxiliary space. Since we are only 
interested in getting 5I/Sip^ u at ip = t], it is straightforward to vary C*p, keeping only the 
linear terms ~ dip, to obtain 

= 5i L /sr u = (r a r£„ - r^rg„) - 

2a^ = d a [r,^{h x PT p Xa - \ h xx r a ) + {hr r a - W a T u - h va Y^ 

+ h a ^ v + r^) + h»r(r%, - v ap ) + h""^ - ry] . (8) 

We now assert that the action which leads to the desired equation R^ v = — t m „ is 

i = i l + [ d i xh^(K l/ r a vivt) . (9) 



Note that we have not added the full Wr^u, but rather used the simple part of t m „. We also 
note that, if our assertion is correct, no iteration will be needed as the cubic term in (9) is 
in fact V'-independent since is a density. Thus 5I/5ip^ v = SI L / '5ip pv , and (9) constitutes 
the full theory as it must, since it is precisely the Einstein action (2) with the identification 
g^v = j^-v _|_ y^v To check that (9) is correct, we compute from the field equations (3). 
These differ from the linear ones in two respects. The first is the (IT — IT) term in (3b), which 
is the simple part of our t m „. The second is that T is now the full Christoffel symbol, i.e, that 
(3a) reads 4 

-h^ a + (rT + h^Ta - W + h^)K P - W p + h v P)T^ p = (10) 

and contains bilinear hT terms, unlike (5a). If we differentiate (10) with respect to a after 
cycling on the indices and separate the linear and quadratic terms we find precisely 

2r^„ iQ — r^.„ — r„.^ = 2R^ U — 2(7^ (if) 

so that with (3b), the net result is just the desired one, 

K, = -r^ ■ (12) 

Thus, thanks to use of first-order form, together with the use of the natural variable g^ v , in 
terms of which the full Einstein action is cubic, the derivation involved just one direct step. Note 



3 It is equivalent and saves computation to work with instead of which is why the source is r M „ = Si/Sip^. 
Also, we set the proportionality constant k between R and r to unity, since it gets reabsorbed anyhow in the final 
redefinition g^ v = rf" + nh^" . Of course, k will appear in front of the matter stress tensor. 

4 The usual infinite non-linearity of the Einstein equations appears when (5a) is solved for V which involves the 
matrix inverse of (/i MI/ + "tf"). Note that rf v assures the existence of this inverse at infinity, where is assumed 
(like any other field) to vanish. 
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also that the two parts of r M1/ have a different origin both in the field equations and in the S/5ip 
procedure. The IT term is the direct part of SI /Sip and of the obvious quadratic contribution 
in the dT equation due to the cubic term. The 07^ part is more subtle: it is only for spin >1 
that the kinetic l pq" term is unavoidably -0-dependent and so contributes to the stress tensor. 
Thus for spin 1, the corresponding term is F >lv {d tl A v — d v A^), which is covariant as it stands, 
taking F^ v to be a density, and likewise we have the covariant form TT^d^cp for spin 0, with 7^ 
a density. However, it is well known that higher rank tensors e.g., symmetric second-rank ones 
must have explicit covariant derivatives. Likewise the a contribution in the field equations is 
due to the non-linearity of the T — h relation, so that it arises from the difference between T 
and its linear part (in h). 

Finally, we return to the coupling of matter. The matter source is taken initially to be 
the conserved current associated with invariance of the free matter system under rigid Lorentz 
transformations, namely T^(r]), and does not, at this stage, depend on h^ v . It is easy to show 
that the correct coupling is according to the usual minimal prescription I M (jf v ) — > I M ' {r\^ v + 
h^ v ): For, on the one hand, the right side of the Einstein equation (12) is to be = 5Im(v + 
hj/Sipv at ip — 0, namely the total matter stress tensor. On the other, viewed as an Euler- 
Lagrange equation, (12) is effectively 5I Tot /Sh^ = 0. Thus, we must have 5I M (ip)/5ip\o — 
SI M (h)/5h whose solution is clearly I M — 1 M {r}^ v + W), remembering that rj — > 77 + ip . 

Consistency has therefore led us to universal coupling, which implies the equivalence prin- 
ciple. It is at this point that the geometrical interpretation of general relativity arises, since all 
matter now moves in an effective Riemann space of metric g^ v = rf v + h^ v , and so the initial 
flat 'background' space rf v is no longer observable. 

Yang-Mills Field 

Consider now as an example of vector theories with internal symmetry, the Yang-Mills field, 
with SU2 invariance [7] . We begin with the linear system, a triplet of free massless vector fields 
with potentials A° and field strengths F£ v , where o=l, 2, 3 is the internal index. The first-order 
action 

h = -\! d 4 x[F^ ■ (dpA„ - d u A^ - I ■ F„„] (13a) 
yields the field equations 

O^F^ = , F^ = d^A v - d u A^ (13b) 

upon independent variation of A M and F MJ/ (we use vector notation for the isotopic index 
throughout). This set of free Abelian gauge fields is invariant under the usual Maxwell trans- 
formations of the second kind, A M — > A + d^A, F — > F which imply that d^F IJ,v is identically 
conserved. This property will require self-coupling (for consistency where sources are present). 
Its form is determined by the invariance under constant internal rotations 5 

e^e + exuj (14) 

where 8 stands for A or F. (The absence of such an invariance for the single real Maxwell field 
is responsible for its linearity.) The associated conserved current is 

= 5I/Sd^(x) = gF^ x A v (15) 

where the variable gauge transformations u{x) have just been introduced as a convenient means 
of obtaining the current according to the usual Noether theorem argument. If we now augment 

5 If we used the current corresponding to rotations about a particular direction only, the resulting theory would 
actually be inconsistent [8], so that the full symmetry must be exploited. 
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the quadratic action I with the self-coupling term j M • A M , which retains (constant) rotation 
invariance, we have 

I = I + J d 4 x^-A^ = I Q + \gf d 4 xA^- F^ x A v (16) 

with field equations 

= d^A v - d v A IJi + gAf, x A v 
d v F^ v = +gF liV x A„ = 3^ (17) 

As in our treatment of the general relativistic case, addition of the same generic cubic term in 
(16) yields the full theory without further iteration. This may be seen in two different ways: 
the self-interaction A • F x A does not involve explicit derivatives and hence will not contribute 
to a further j M term, as defined by (15). Alternately, the j M defined in (15) is already conserved 
as a consequence of the full equations (17), which are thus consistent as they stand: 

djj/i = g(F^^xA„ + F M1/ xA„ )M ) = . (18) 

The action (16) is, of course, the complete Yang-Mills action in first-order form. It is invariant 
under an extended group of gauge transformations of the second kind, although this was not 
required initially (as was the case for the corresponding general coordinate invariance of the 
full Einstein equations). This is a basic difference between the present and those derivations [9] 
which are based on the extended invariance requirements. 

The above derivation exhibited the Yang-Mills theory as one in which the isotopic current 
is the source of 9 M F M1/ , rather than of the linear expression d ll (df l A u — d^A^). Indeed, the 
two differ by a term 9^(A M x A v ) which is identically conserved, but not obtained from gauge 
invariance. This is just the converse of the Einstein situation, where t^ v was the source of the 
R^ v (h) rather than of T^ v a — (which is not identically conserved). 

Had we started from the second-order formalism, so that 

Jo = -\ J d*x{d»A. v - d v Arf (19) 

its invariance under 8 A = A x u> yields the current 

% = gA v x (<9 M A„ - d v A^) . (20) 

We would then expect to make the addition 

h = § J d 4 x !j • A = \ g J d 4 x A M • (fy A„ - d v A^ x A v . (21) 

Since equation (21) still involves explicit derivatives, it would yield one further iteration 

% = g 2 (A^ x A„) x A v , I 2 = \g 2 Jd 4 x(A^ x A„) 2 (22) 

in which there are no derivatives left. The total action is the familiar second-order form of 
Yang-Mills theory, 

I = h + h+h = -\ J d 4 xF-F , F M „ = D„A„ - D„A M (23) 

in terms of the "i-covariant" derivative D M = (d^ — \ gA^x). 

The final cubic (16) and quadratic (23) forms are of course equivalent. However, the proce- 
dure leading to the latter does not fulfill the original self-coupling postulate, since variation of 
the first iteration (21) does not yield field equations with 1 j M as source, but has the additional 
9„(A M xA„) term mentioned earlier. 
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Just as in the Einstein case, conservation of the current permitted, but did not require, 
self-interaction in the absence of sources. However, just as in the Einstein case, it is necessary 
in the only interesting situation, in which a dynamical current J p interacts with the field. For 
example for a fermion field ip, its current J = gip^d^Tip will not be conserved as a result of 
the Dirac equation, but will obey a 'covariant conservation' law, and so cannot be consistently 
coupled to the linear theory (since d^F^" = 0) even though J • A is rotationally invariant. It 
then becomes necessary to introduce self-interaction, that is transversality of the field equations 
with respect to covariant differentiation (D M D„F M,y = 0). Our argument is of course no longer 
compelling for a massive vector field, since the mass term can always absorb the non-conserved 
part of the current without need for non-linear terms, according to M 2 A v ^ = 3 U ^ ~ J„ x A v . 
However, it is still perfectly consistent to iterate and obtain the massive version of Yang-Mills 
theory. 

Spin Zero Systems 

Unlike the situation for tensor and vector fields, a non- linear theory is not mandatory for 
spinless particles, because as we shall see, there is no clash between external current non- 
conservation and the free field equations. However, it is still possible to carry out the same 
procedure, and insist that the source of the field be the total current, including that of the 
massless field itself. We consider here one example, 6 which leads to the Sugawara theory [11,12] 
of currents. 

Rather than obtain the chiral Lagrangian in one or another particular spin zero representa- 
tion, which would correspond to deriving the non-linearities of Einstein theory in a particular 
gauge, we shall reach it in a general form. To this end, consider the quadratic action 

Jo = - \ J d 4 x[F^ • (d„A v - 0„A„) + cA2] (24) 

which describes a triplet of purely longitudinal free massless fields with field equations 

0„A„ - dpA^ = (25a) 

dfiA^ = . (25b) 

This is the Abelian limit of Sugawara theory and is equivalent to a triplet of free massless scalar 
fields [since (25a) implies A M = d^cf>, and (25b) yields U<f> = 0]. If we now adjoin the current 
term, which is just I\ <~ J A ■ F x A, as in the Yang-Mills case (since the invariance is the same), 
we obtain 

J = Jo + /i = -| / d 4 x[F^ ■ (D^A, - D„A M ) + cA^] (26) 

where D M = - \ gA^ x . 

This action has been discussed elsewhere [13], and shown to be a Lagrangian formulation of 
the Sugawara model (extension to the SU2 x SU2 case is immediate). This is clear from the fact 
that the resulting field equations are the usual 

D^A U - D U A^ = d^Au - <9„A M - gA^ x A v = (27a) 

d^A" = (27b) 

and that the equal time commutation relations are also identical. The action is again only cubic, 
although it becomes an infinite series when expressed in terms of the spin zero pion field in one 
or another of the non-linear realizations of the field equations, corresponding to the choice of 



6 The simplest spin zero example, that of 'scalar gravitation', where a single scalar field is coupled to the trace of 
its stress tensor, is treated in second-order form in [13] and [10]. A later communication will deal with the Nordstrom 
theory. 



6 



basic pion field tt to represent the solution of (27a). The same procedure could be attempted 
starting with the scalar representation 

io = -J dM^-^-s* 2 ] ( 28 ) 

of a triplet of massless pions, and using the invariance under Stt = 7r xu , 5<f> = <(>xuj + uj' [App. 
B]. However, this is considerably more complicated than the above representation-independent 
treatment and the self-coupling prescription is not directly fulfilled at each step. Note that 
there is no necessity here of adding the non-linear term, since the original field equations in the 
presence of an external current J M (coupling J • A), 

= 0„A„ - 0„A„ , d^W v = cA v + J„ (29) 

allow for non-conserved 3 V , with OA ~ D(f> ~ <9J, as for massive vector theory. Unlike the 
transverse Yang-Mills field, this purely longitudinal gauge field does not possess the initial 
gauge invariance A — > A + dA which required the non-linearity there. It is, however, natural 
to add the cubic term so that covariant derivatives enter throughout. 

Appendix A 

We sketch here an amusing 'geometrical derivation' of the Einstein or Yang-Mills equations 
which is, however, less compelling than that in text. Consider the free spin 2 equations in terms 
of the variables 7^3, (f)^ (where M „ is ultimately g^ v — rj^ u ). Then the field equations read 

2 7^, a ~ 7m," - 7",m = 

1% = I^IW + ^m-^]- (A-l) 

In a background metric space they would then have the same form, but with all derivatives 
replaced by covariant ones with respect to the background, and with 77 — > g. We now identify 7 
and <f> as small variations of the background T and g. Then these covariant equations 'integrate' 
to the usual Einstein ones (3), using the Palatini identity 

SR^ = (5T« v ). a - \{5T^), V - ±(5T v ). tfl 

together with the obvious one for ST" and recalling that ST, unlike T itself, is a tensor. In 
particular, (A.l) thus represents the small oscillations of the Einstein field near flat space or with 
respect to a local inertial frame where T = and g — 77. Likewise we could start with the 'flat 
space' equations (13b) in terms of small oscillations , a„ which, in the presence of an external 
A p field, take 'the minimal' form with — > (<9 M — gA^x). Identifying f = 5(F), a = S(A) and 
integrating then yields precisely the full Yang-Mills equation (17). In the gravitational case, this 
argument is strengthened by the fact that the massless spin two field equations in curved space 
are in general inconsistent [15]; thus the identification <f), 7 — > 5(g), 5(T) is the only logical one. 

Appendix B 

We describe here the result of the iteration procedure on a triplet of massless spinless par- 
ticles, starting from the scalar representation, rather than the vector one treated in text. The 
initial action, 

h = - J[^-d^-^7v 2 }d 4 x (B.l) 

is invariant under combined isotopic rotations of (tt, <j>) and also translations of d>. The isotopic 
rotations alone lead to iterated currents and self-interaction Lagrangians of the form 

ft = A"(t^ x 0) x . . . x , L n = X n (^ x tt) x . . . x 4> tll (B.2) 
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Using vector product identities, this sums to 

I = - J d A x[^ • [1 + A 2 00 + x \<p] • 0^(1 + AVr 1 - \ 7T 2 ] (B.3a) 

j M = A(l + A 2 2 )- V M x + A(l + A 2 )" 1 ■ (00 + 2 1) • tt] . (5.3b) 

To reach the more familiar second-order form, we use the easily derivable equivalence between 
the first and second order actions 



d,cf> \ tt 2 ] «- I = i / jM • P 2 (0) • 0^ (B.4) 

for symmetric dyadics P(0). Then (B.3) become 

I = J>,„ • (1 + A 2 00) • 0,^(1 + A 2 ^ 2 )- 1 (B.5a) 

j„ - A0 iM x 0(1 + AV)- 1 . (B.5b) 

The conserved current (B.5b) is just the isotopic part of the usual chiral current, in the 
representation in which it reads 

j^[0, Al + A0x0^](l + A 2 2 )- 1 . (B.6) 

The <9^0 part could be obtained by introducing an initial a field and iterating on the combined 
initial chiral invariance. 7 

If we consider the combined rotations and translations of (B.l), with Scf> = x lj ± A _1 u; 
(which arc not quite of the chiral form) we get 

j;=^(x\<p±ir , L„=j^-0 M . (B.7) 

Using the identity 

oo 

^(xA0 ± 1)" = (x A0 ± I)" 1 = -(xA0 + 1)(1 + A 2 00)(1 + A 2 2 )- 1 (B.8) 
o 

these may be summed to yield 

I = - f[± Tr" • (1 + A 2 00) • M - Att^ x • 0^(1 + A 2 2 )-! - i tt 2 ] . (B.9) 
In second-order form, this reads simply 

1 = -\ I k • (i + A2 <w>) • + ^vr 1 ( s - i0a ) 

j" = [A0 x iM ± iM • (1 + A 2 00)](1 + A 2 2 )- x (B.lOb) 

with I = — \ J jfj, ■ j M . The above current differs from the usual chiral one by the multiplicative 
factor (1 x A 2 00), namely the extra 00 term in (B.lOb). Likewise, the Lagrangian jjjj^ differs 
from (B.lOa) by an extra denominator (1 + A 2 2 ) -1 in the (0 • M ) 2 term. This model is then a 
'dynamical theory of currents' different (in its current commutators) from the Sugawara model. 



7 One could also start from two triplets in (B.l), which would correspond to the SU2 x SU2 initial Sugawara form, 
namely a sum of two actions of the form (24). 
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